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Abstract 

In the online packet buffering problem (also known as 
the unweighted FIFO variant of buffer management), we 
focus on a single network packet switching device with 
several input ports and one output port. This device 
forwards unit-size, unit-value packets from input ports 
to the output port. Buffers attached to input ports may 
accumulate incoming packets for later transmission; if 
they cannot accommodate all incoming packets, the ex- 
cess is lost. A packet buffering algorithm has to choose 
from which buffers to transmit packets in order to min- 
imize the number of lost packets and thus maximize the 
throughput. 

We present a tight lower bound of e/(e — 1) « 
1.582 on the competitive ratio of the throughput maxi- 
mization, which holds even for randomized algorithms. 
This improves the previously best known lower bound 
of 1.4659 and matches the performance of the algo- 
rithm Random Schedule. Our result contradicts the 
claimed performance of the Random Permutation al- 
gorithm; we point out a flaw in its original analysis. 

Keywords: online algorithms, packet buffering, buffer 
management, multi-queue switches. 

1 Introduction 

We study the unweighted FIFO variant of the 
buffer management problem introduced by Kessel- 
man et al. [KLM+04]. In this problem, also called 
the packet buffering problem [AS05], we focus on a sin- 
gle network packet switching device which has m input 
ports and one output port. The goal of such device is to 
forward packets from its input ports to the output port. 
The burstiness of the incoming traffic motivates the use 
of buffers which can accumulate incoming packets and 
store them for later transmission. We assume that all 
packets are of unit size and that each input port has 
an attached buffer able to store up to B packets. We 
consider the unweighted case, in which all packets are 
equally important. 
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Time is slotted in the following way. At any 
(integer) time t > 0, any number of unit-sized packets 
may arrive at the input ports and they are appended to 
the appropriate buffers. If a buffer cannot accommodate 
all the packets, the excess is lost. Then, during the 
time step corresponding to time interval {t,t+ 1), the 
device can transmit a single packet from a single buffer. 
The key difficulty of this problem is that it is inherently 
online: the buffer managing algorithm does not know 
where packets will be injected in the (nearest) future. 
The goal is to minimize the number of lost packets; 
the only decision made by the algorithm at time t 
is choosing the buffer from which a packet will be 
transmitted in step (t, t + 1). 

In our setting, no information about the future is 
available to the algorithm, i.e., the decision of an online 
algorithm at time t has to be made solely on the basis 
of the input sequence up to time t. In particular, 
we make no probabilistic assumptions about the input, 
and assume it is created in a worst-possible manner 
by an adversary. For analyzing the efhciency of online 
algorithms, we use competitive analysis [BE98], and — 
on any input sequence — compare the throughput (the 
number of packets transmitted) of an algorithm and the 
optimal offline schedule. The supremum (taken over all 
inputs) of the ratios between these two values is called 
the competitive ratio of the algorithm and is subject to 
minimization. An algorithm is called i?-competitive if 
its competitive ratio is at most R. In case of randomized 
online algorithms, in the definition above, we replace the 
throughput by its expected value. 

The queuing model considered in this paper is typ- 
ical for input-queued switches or routers, which are the 
dominant packet switching architecture of the Internet. 
Moreover, despite the popularity of theoretical research 
on Quality-of-Service solutions (cf. Section 1.2), most 
of the current networks (most notably those using IP 
protocol) provide only "best effort" services, where all 
packets are treated as equally important and do not 
have to respect any deadlines. The unweighted vari- 
ant considered hereby is therefore typical for such net- 
works. Minimizing the packet loss is an important issue, 
as subsequent packet retransmissions by the standard 



transport-layer protocols like TCP are quite costly, and 
may lead to performance degradation. Finally, there is 
an observable evidence that the network traffic exhibit 
the so-called self-similar properties [LTWW94], which 
renders analyses based on stochastic queuing theory in- 
applicable. Instead, we apply the worst-case, competi- 
tive analysis in place of stochastic assumptions. 

1.1 Previous Work. The contributions on this topic 
fall into three main categories: the results for determin- 
istic algorithms, randomized algorithms and (determin- 
istic) algorithms for the fractional model. In the frac- 
tional model (described in detail in Section 2.2), the 
algorithm may transmit arbitrary fractions of packets, 
as long as the total load transmitted in a single step is 
at most 1. It is a simple observation that any random- 
ized algorithm can be transformed into a deterministic 
fractional one without increasing its competitive ratio, 
i.e., the fractional model is "easier" for algorithms and 
"harder" for the adversary. A relaxed relation in the 
opposite direction also exists: Azar and Litichevskey 
showed how to transform a c-competitive fractional 
solution into a deterministic c • (1 + [i7,„ + \\/B)- 
competitive one [AL06]. 

First, we describe results on randomized and frac- 
tional solutions. The currently best lower bound of 
1.4659 (holding for for arbitrary buffers sizes B and 
large m) on the competitive ratio of any random- 
ized algorithm is due to Albers and Schmidt [AS05]. 
In Section 2.3, we highlight the details of their ap- 
proach and show that in fact it also works for the frac- 
tional scenario. Azar and Richtcr presented the ran- 
domized algorithm Random Schedule [AR05], whose 
competitive ratio is ^^ + o(l) ~ 1.582. Azar and 
Litichevskey [AL06] showed how to encode the frac- 
tional variant of the problem as an instance of the on- 
line fractional matching in a bipartite graph. By con- 
structing an ^^-competitive solution (based on a nat- 
ural "water level" approach) of the latter problem, they 
obtained a fractional ^^-competitive algorithm. In our 
paper, we refer to it as Frac-Waterlevel. Finally, 
this bound was claimed to be improved by the Ran- 
dom Permutation algorithm [Sch05] with competitive 
ratio 1.5 for all values of B and m. These competi- 
tive ratios were improved for the particular case of two 
buffers and any B, where an algorithm attaining the 
optimal competitive ratio of 16/13 « 1.231 was given 
by Bienkowski and M^dry [BM08] . 

As for the deterministic algorithms, the general 
upper bound holding for all values of B and m was 
given by Azar and Richter [AR05]. They proved that 
any work- conserving (i.e., serving a non-empty queue) 
algorithm is 2-competitive. They also showed the 



lower bound of 1.366 — Q{l/m) holding for any B. 
Albers and Schmidt [AS05] improved that bound to 
-^^ for 771 ^ B. They also showed the algorithm 
Semi- Greedy which is 1.889-competitive for large B. 
By applying the fractional-to-deterministic reduction 
described above to the algorithm Frac-Waterlevel, 
Azar and Litichevskey obtained a deterministic ^^j • 
(1 + [iJ„j + IJ /i?)-competitive algorithm (which we will 
call Det-Waterlevel) [AL06]. Again, the results 
can be improved for particular cases: when B = 2, 
then the Semi- Greedy algorithm achieves an optimal 
competitiveness of 13/7 w 1.857 [AS05]; when m = 2, 
the optimal competitiveness of 16/13 « 1.231 was 
achieved by the Segmental Greedy algorithm by 
Kobayashi et al. [KMO08]. 

Most of the algorithms described above were eval- 
uated experimentally in the paper of Albers and Ja- 
cobs [A J 10]. 

1.2 Related Work. The throughput maximization 
problem considered in the unweighted version in our 
paper was also studied in a more general context, where 
packets may have different weights and the goal is 
to maximize the the total transmitted weight. Such 
an extension tries to capture differences of importance 
between various data streams. Upon packet injection, 
the buffer managing algorithm makes a decision whether 
a packet should be accepted to the buffer or dropped 
immediately. Afterwards, accepted packets have to be 
transmitted in FIFO order. Such a problem is non- 
trivial even for the task of managing a single input 
buffer [AMRR05, AM03, AMZ03, Zhu04]. There is also 
a preemptive variant were packets may be dropped from 
the queue [And05, AMZ03, EW09, KLM+04, KM03, 
KMvS05, LP03, MPL04, Zhu04]. 

Another variant of the buffer management problem 
is the so-called bounded-delay scenario, where neither 
the buffer has a fixed capacity nor FIFO order is 
imposed. Instead, each packet specifies a deadline and 
it must be either transmitted before the deadline or 
dropped [AMZ03, CCF+06, CF03, EW07, HajOl, JezlO, 
KLM+04, KMvS05, LSS05, LSS05]. 

For a comprehensive description of these and related 
models, we refer interested readers to the recent survey 
by Goldwasser [GoUO]. 

1.3 Our Contribution. In this paper, we present 
a construction which shows that no fractional algorithm 
may achieve a competitive ratio lower than e/(e — 1) ~ 
1.582 for any value of B and for large m. Our result has 
a few implications: 

• The result is up to lower-order terms optimal; 
it matches the performance of the fractional al- 



gorithm Frac-Waterlevel of [AL06]. It also 
gives evidence that the reduction of Azar and 
Litichevskey from the fractional packet buffering to 
online fractional bipartite matching was essentially 
tight. Even though the fractional matching is more 
general, the competitive ratios achievable for both 
problems are the same. 

• The same lower bound holds for randomized al- 
gorithms. Thus, it improves the currently best 
lower bound of 1.4659 (also holding for any B and 
large m) [AS05]. It is also up to lower-order terms 
optimal for randomized algorithms, as it matches 
the performance of the Random Schedule algo- 
rithm [AR05]. 

• The lower bound contradicts the claimed competi- 
tive ratio of 1.5 of the Random Permutation al- 
gorithm [Sch05]. In the appendix, we pinpoint the 
subtle, yet fundamental flaw in the original analy- 
sis of this algorithm. Roughly speaking, the main 
source of the flaw was neglecting certain types of 
adversarial strategies; these strategies are actually 
employed in our lower bound, cf. Section 1.4. 

• The lower bound of c/(e — 1) for deterministic al- 
gorithm given by Albers and Schmidt [AS05] can 
be applied only when m ^ B. Our construc- 
tion yields the same ratio, but requires only that 
TO is large; B may be arbitrary, e.g., even larger 
than m. In particular, in contrast to their lower 
bound construction, ours shows that the determin- 
istic algorithm Det-Waterlevel [AL06] achieves 
an asymptotically optimal competitive ratio when 
both TO is large and B 3> log to. 

1.4 Used Techniques. While the formal construc- 
tion of the lower bound is given in Section 2 and 
Section 3, here wc informally describe three key ingre- 
dients. 

First, our adversarial strategy creates a packet 
injection sequence which at time completely fills all 
the buffers and can be served losslessly by the optimal 
algorithm. Moreover, after each injection, all buffers of 
the optimal algorithm are full. These assumptions are 
rather standard and we list them only for completeness. 

Second, wc observe that in some cases delaying 
packet injections and injecting multiple packets at once 
incurs a greater packet loss for the algorithm. To give 
a specific example for a randomized algorithm, assume 
that the buffer size B is 1 and at time t, the expected 
number of packets at each buffer is quite low. Then, 
injecting a packet at times i, i -I- 1, and t -|- 2 is more 
benign to the algorithm than not injecting a packet at 



time t, injecting two packets (to two different buffers, bi 
and bj) at time t+1 and injecting a packet at time t + 2. 
At first glance, the former approach tries to incur a loss 
of the algorithm more aggressively, while the latter gives 
the algorithm more time to prepare. However, the latter 
approach has a main advantage for an adversary: at 
time t -I- 1, both buffers bi and bj become full. This 
knowledge can be exploited by the adversary, which 
may inject a packet at time i -I- 2 to one of these two 
buffers. On expectation, the incurred loss caused by this 
injection is then at least 1/2. Although the observation 
presented above may seem simple, it was not exploited 
by previous lower bounds. Moreover, neglecting these 
types of effects is the main source of the flawed analysis 
of [Sch05]. 

Third, our construction is an iterative approach 
which, given an adversarial strategy S for n initially 
full buffers reducing the average load of the buffers to 
a specific level, constructs a strategy S' for n^ buffers, 
which uses the former strategy as a black box. The 
strategy 5" uses only slightly more steps (in comparison 
to the number of buffers) and reduces the average load 
to even smaller amount than S does. By applying the 
construction iteratively, in the limit we obtain a strategy 
for M buffers which reduces the average load of the 
buffers to o(l) in time M ■ {e - 1) + o{M). Hence, 
neglecting the lower-order terms, the throughput of the 
algorithm on such an injection pattern is M • (e — 1). 
On the other hand, right after the last injection, the 
buffers of the optimal solution are still full, and thus its 
throughput is M • (e — 1) + A/ = A/ • e. This implies 
an asymptotic lower bound of e/(c — 1). 

2 The Basics 

2.1 PreUminaries. Throughout the paper, to de- 
notes the number of buffers and B the size of a single 
buffer. We denote the set of all buffers by B 

At any integer times t > 0, the adversary may inject 
arbitrary number of packets to arbitrary buffers. It may 
also choose not to inject anything. If a buffer cannot 
accommodate all the packets, the excess is lost. We 
assume that packet injection is instant and therefore 
we distinguish between the state of buffers at t~ (right 
before packets injection at time t) and i+ (right after 
the injection). 

Then, during the time step corresponding to time 
interval (i,t + 1), the algorithm may transmit a single 
packet from the buffer of its choice. We assume 
that after the last packet injection, the algorithm has 
sufficient time to transmit all the packets it still has in 
its buffers. 

For any algorithm A and any sequence a of packets' 
arrivals, we denote the throughput of A on cr by T{^{<j). 



When A is randomized, Tj^{(j) denotes its expected 
throughput. For any algorithm, its competitive ratio is 
defined as supg.{TopT(o')/TALG(c)}, where supremum 
is taken over all possible inputs and Opt is the opti- 
mal offline algorithm. For randomized algorithms, we 
assume oblivious adversaries [BE98], i.e., the ones that 
do not have access to the random bits used by the algo- 
rithm. 

2.2 Fractional Model. It is usually more natural to 
think about lower bounds for deterministic algorithms. 
Therefore, in this paper, we consider a fractional variant 
of the problem, in which a deterministic algorithm may 
transmit fractional amounts of packets. Note that we 
change solely the capabilities of online algorithm, which 
means that: (i) the injected packets are still integral and 
can be injected at integer times only, (ii) an optimal 
offline solution to which our solution is compared is 
chosen among integral solutions. We call the amount of 
packets in a buffer the load of this buffer. For a subset 
of buffers, the load of these buffers is the sum of the 
respective buffers' loads, and the total load is the load 
of all buffers. Further, in each step the load transmitted 
from the buffers is at most 1. The following reduction 
is an easy observation and it is shown in the appendix 
for completeness. 

Lemma 2.1. For any randomized algorithm Alg for 
the integral model with competitive ratio R, there exists 
a deterministic algorithm Alg for the fractional model 
whose ratio is at most R (for the same buffer size and 
number of buffers). 

By Lemma 5.1 of [AL06], without loss of generality, 
we may assume that an algorithm is work-conserving^ 
i.e., in each step it transmits a load of 1 if it has it, 
and its total load otherwise. We silently make this 
assumption in our proofs. 

2.3 Canonical Strategies. By Lemma 2.1, we only 
need to show a lower bound for deterministic algorithms 
in the fractional model. In our construction, for simplic- 
ity of the description, we assume that B = \. We note 
that the presented approach is valid with virtually no 
changes for other values oi B: we only have to replace 
an injection of a single packet by injection of B packets 
to the same buffer and replace each step by B consecu- 
tive steps. 

An elementary building block of the constructed 
input sequence is an action. It is parameterized by 
a subset of buffers A and a positive integer a called 
length. When we say that the adversary executes action 
(A, a) at time i, we mean that nothing is injected during 
time period {t,t + a). Then, at time t-\-a^ the adversary 



chooses a buffers with the maximal load among buffers 
from A and injects a packet to each of these buffers. 
Ties are broken arbitrarily. We call set A active in time 
period {t,t-\-a\ or active for this action, and we say that 
such action operates on set A. The considered action 
starts at time t and ends at time t -\- a. 

A brief characterization of the inputs we construct 
is as follows. First, at time 0, the adversary injects 
a packet to each of m buffers; this is called the initial 
injection. Second, at time 0, it executes a canonical 
(adversarial) strategy. Such a strategy is a sequence 
of actions, where for each consecutive pair of actions, 
the latter starts exactly when the former ends. (For 
technical reasons, which become clear when we describe 
our recursive construction, the initial injection is not 
the part of the canonic strategy.) Further, we define the 
length of a canonical strategy as the sum of its actions' 
lengths; note that the length of a strategy starting at 
time coincides with the last time at which packets are 
injected. We say that something happens during the 
strategy of length i executed at r if it happens within 
time interval (r, r + £] . 

In our construction, we consider canonical strategies 
only. Canonical strategies are neat to analyze, because 
the optimal algorithm can serve the corresponding 
inputs losslessly. 

Lemma 2.2. For a canonical adversarial strategy of 
length £ preceded by the initial injection, the throughput 
of the optimal offline algorithm is m + £. 

Proof. Let (Ai,ai),(A2,a2),...,(Afc,afe) be the canon- 
ical strategy. For any 1 < j < fc, during Oj time steps of 
action j, Opt transmits packets from these aj buffers 
to which a packet is injected at time X]i=i ^i (^^ the 
end of action j). In effect, during the first X]j=i o,i — ^ 
steps. Opt transmits a packet in each step and does not 
lose any. Finally, at time £+, the buffers of Opt are still 
full and it may transmit all m packets afterwards. D 

By the lemma above, we may use a canonical 
strategy to show a lower bound on any algorithm in the 
following way. We show that after a given strategy S of 
length I is executed against any algorithm Alg, the 
total load of Alg at time t^ is at most c. Then, 
during the first I steps, the load transmitted by Alg 
is at most £, and afterwards it is at most c. On the 
other hand, by Lemma 2.2, the throughput of Opt is 
(. -\- m. Hence, S implies that the competitive ratio of 
any algorithm is at least {l + m)/{l + c). Therefore, our 
goal is to construct an adversarial canonical strategy 
which reduces the total load of any algorithm as quickly 
as possible. 



In our construction, for a given number of buffers to, 
we give a strategy where tlie lengtlis of tlie correspond- 
ing actions are fixed (and known to the algorithm). The 
choice of active sets will be algorithm dependent; we de- 
scribe it in detail in the next sections. 

2.4 Uniform Strategies. The most straightforward 
canonical strategies, used previously in the literature, 
are ones in the form {A, 1), {A, 1), . . . , {A, 1), where A is 
a fixed set of buffers. In other words, when executing 
such a strategy, the adversary injects a packet at the 
end of each step into the most populated buffer from 
set A. We call such strategies A-uniform. 

For any action {A, a), we use {A, a) x fc as a short- 
hand for the sequence consisting of k consecutive ac- 
tions {A, a). The behavior of algorithms on S-uniform 
strategies (recall that B is the set of all buffers) was 
investigated previously by Albers and Schmidt [AS05]. 
Hereby, we show a slightly more general result, which 
become useful later: we consider ^-uniform strategies 
for arbitrary sets A. 

Definition 2.1. For any subset A consisting of n 
bujfers and a real (3 > 0, the strategy Sq{A) is the 
canonical uniform strategy {A, 1) x [/3n] . 

Lemma 2.3. Fix a real j3 > 0, any subset ofn buffers A, 
and let £ be the length of the strategy Sq{A). Assume 
that at a step t, all buffers of A are full and the 
adversary starts to execute the strategy Sq{A). Let E 
be the load transmitted during Sq{A) from buffers not 
in A. Then the load in A at time (t -\- i)~^ is at most 
E + n-e-'^ + 1. 

Proof. For simplifying the notation, wc assume that 
r = 0. For any integer t > 0, let Ct be the load of A 
at time t^, and for t > 1, let at £ [0,1] be the load 
transmitted from A in step {t — l,t). 

Since at each time a packet is injected the total load 
is at least 1, the algorithm is able to transmit the load 
of 1 in each step. Therefore, E = J2t=ii^ ~ ^t)- 

We establish a recursive relation for Ct- Clearly, 
Co = n. Fix any t £ {1, . . . ,£}. At time t~ , the load in 
the most populated buffer of A is at least {Ct-i — at)/n. 
When the adversary injects a packet to this buffer at 
time i, the load in A becomes Ct < (Cf_i — at) ■ (1 — 
l/n) + l < (Ct_i-l)-(l-l/n) + (l-at) + l. Hence, d < 
{n~l)-{l~l/nY + J2l^^{l-at) + l < n-e~^/" + E + l. 
By ^ > Pn, the lemma follows. D 

We remark that the previously best lower bound, 
due to Albers and Schmidt [AS05], was to use the 
strategy Sq{B) with a numerically optimized /? and 
large m. The best choice is /3 « 1.1462, which for 



771, — >■ oo causes any algorithm to have ratio at least 
1.4659. 

3 Lower Bound Construction 

As mentioned in the introduction, the crucial observa- 
tion behind our construction is that sometimes it is ben- 
eficial to delay injections first and then to inject more 
than one packet at once. To give a specific example, 
assume that m = n^ and consider a strategy 

P=(S,n),(Ai,l)xl, 
(B,n),(A2,l)x2, 

(B,n),(A„_i,l)x(n-l), 
(B,n),(A„,l) x n . 

We have to further define active sets Aj of partic- 
ular actions. The sequence P can be divided into 
n phases, the j-th one corresponding to the subsequence 
{B,n),{Aj,l) X j. Aj is the set of buffers to which 
packets are added at the end of action {B, n) of the 
j-th phase. 

Observe that the length of P is roughly l.bn? = 
l.bm. Using the bounds given later in our proofs, 
one can show that already this simple strategy incurs 
a lower bound of 1.5589 on the competitive ratio of 
any algorithm. We may compare P with the S-uniform 
strategy (S, 1) x £ of the same length. By Lemma 2.2, 
the throughput of Opt on both strategies is the same, 
and therefore the key factor determining the efficiency 
is the total load at time ^+. It can be computed 
that the total load at that time is at most 0.104 to for 
the strategy P and at most 0.223 rri for the ;B-uniform 
strategy. The approach illustrated by the strategy P is 
the main building block of our construction. 

Now, wc concentrate on a single part (Aj, 1) x j of 
the strategy P. Wc observe that it is focused entirely 
on decreasing the load in Aj. (The algorithm might 
transmit the load from other buffers as well, but as 
we implicitly show later, it is rather a bad choice.) 
As the strategy P is more efficient in decreasing the 
load than the uniform strategy, why not replace the 
sequence {Aj,l) x j by a P-typc strategy of length 
roughly j operating on the set Ajl Furthermore, why 
not perform such a replacement recursively? Such 
top-down replacements arc viable, but troublesome to 
define. Therefore, we apply a bottom-up approach: we 
fix n, start from a strategy for n buffers and using it as 
a black box, we build a better strategy for n^ buffers. 
By applying such a transformation iteratively, in the 
limit we obtain a desired construction, incurring a lower 
bound of e/(e — 1). 



Sr (on 16 buffers) = 4, S'/'a, SI'\a, S^ = 4, (2, S'/') , 4, (2, S^/') , 4, (2, 5y^ 2, S,^/^ 

= 4,(2,(1)), 4,(2,(1)), 4,(2,(1), 2,(1,1)) 
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Figure 1: Example strategy S2 on 16 buffers identified with capital letters. Sets were omitted from description 
of the strategy and placed on the picture instead. They denote buffers to which the adversary injects packets at 
steps marked with dots. 



3.1 Building Lower Bounds Iteratively. Let n be 

a parameter of our construction. For any integer q > 0, 

T?„. In this section, 



let n„ 



2' 
n , I.e., no 



n and n„ 



for any /3 G (0, 1], we define strategies S^j^i operating on 
Tig+i buffers on the basis of strategies S^ operating on 
Uq buffers. The strategies Sq are the uniform strategies 
of Definition 2.1. 



Definition 3.1. Fix an integer q > 0. For any 
subset A o/riq+i buffers, and any (3 e (0, 1], let 

{A,nq), Sf^-iA^), 



(An,), 5f "'!/"' (Ar^„,l) , 

where Aj is the set of Uq buffers to which packets 
were injected during the action (A, Uq) immediately 
preceding Sq (Aj). 

We say that S^^i{A) operates on set A. Note that 
by the recursive definition above, all actions of such 
strategy operate on A or its subsets. Observe also 
that if m = n^, the strategy Sl{B) coincides with the 
strategy P. 

It is helpful to interpret this adversarial strategy as 
a (program) routine, which calls another subroutines. In 
the example given in Figure 1, routine S'2 operating 
on 2^ 16 buffers {A, . . . , P} calls three subroutines: 



Si on {C,D,F,K}. Within these subroutines, the 
adversary operates on a given set of buffers and ignores 
all the remaining ones. Note that the algorithm is not 
bounded in this way: it may always transmit the load 
from an arbitrary buffer. In the figure, wc used brackets 
to emphasize the recursive structure of the strategy, and 
we omitted sets for succinctness. 

Our construction implicitly builds a hierarchy of 
buffer sets: if a strategy (routine) operating on set A 
calls another strategy (subroutine) operating on set A' , 
then A' C A. Moreover, at the beginning of the 
subroutine, all buffers from A' are full. 

3.2 Technical Relations. Wc present two technical 
facts that wc use in our bounds. They arc both proven 
in the appendix. 

Fact 3.1. For any non-negative integers n > Q, q, k 
and any positive real j3, the following inequality holds: 



■^-^ \,n„ n 



i=i 



fc + 1 



2{q + l] 



k+l 



Fact 3.2. For any non-negative integer q, the following 
inequality holds: 



q h 



EJ2h ^ 1-' 



1 



h=0 fc=0 



(9+1)! 



,1/4 



Si'^ on buffers {A, B, C, D}, Sf on {B, D, E, G}, and 



2/4 



3.3 Bounding the Length of S^. Below, wc bound 
the length of our strategies 5^ . 



Lemma 3.1. Fix integers n > 0, q > 0, a real P S (0, 1], 
and any subset A consisting ofriq buffers. LetT^ he the 
length of the strategy S^ {A) . Then, 



9+1 



TP < 



n. ^ — ' 



(/3 + 2g/n) 



k=\ 



k\ 



Proof. The proof follows by induction on q. For q = 0, 
the right hand side of the inequality is /3n + 1, and thus 
by Definition 2.1, the basis of the induction holds. 

Assume that the bound holds for q, and we prove it 
for q+ 1. By Definition 3.1, it holds that 

r/3n,l 

^'+1 = E i^'^ + Tt') ■ 
i=i 

Using inductive assumption. Fact 3.1 and /3 < 1, we 
derive 



T. 



r/5n,l / , g+1 
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j=i \ k=i 
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which completes the inductive proof. D 

3.4 Bounding the Efficiency of S^. We measure 
the efficiency of strategies S^ in reducing the total load 
in a set of buffers. We assume that the total load never 
drops below 1 as this assures that the algorithm trans- 
mits load 1 in each step, which simplifies the analysis. 
Otherwise, the adversary may stop the sequence as the 
goal of reducing the load was achieved anyway; we post- 
pone the details to the proof of Theorem 3.1. 



Proof. We use induction. As Qo = !/"■ and Fq — 1, 
the induction basis (q = 0) follows immediately by 
Lemma 2.3. We assume that the the bound holds for q 
and prove it for g -I- 1 . 

Recall that S^^i operates on set A consisting 
of Uq+i ~ n^ buffers. We divide this strategy 
into [/3nq] phases, where the j-th phase consists of 
an action {A.riq) followed by the strategy Sq {Aj) 
(cf. Definition 3.1). 

Let t^ — T and let tj be the ending time of 
phase J, i.e., phase j lasts between tj-i and tj. For 
j G {0, . . . , [/3nq]}, let Cj denote the total load in A at 
time f^ . Further, let Ej be the load transmitted during 
phase j from buffers outside of A. 

Let 7 = e"^/"'. We show that the following relation 
holds for any phase j. 



(3.1) C, < 7 • Q-i + (7^' ■ i^,^/"' + Q,+i) • nq + 



E, 



Lemma 3.2. Fix a real j3 S (0, 1], any subset A consist- 
ing of riq buffers, and let (, be the length of the strat- 
egy S^{A). Assume that at a step t, all buffers of A time tt are as follows 
are full and the adversary starts to execute the strategy 
SP{A). Assume that for any time t € {r, t -\- 1, . . . ,t + 
i— 1}, the total load (in all buffers) is at least 1. Let E 
be the load transmitted during S^{A) from buffers not 
in A. Let L^ be the total load in A at time (r + £) + . 
Then, 

L^q<E+ (e-^ • F!^ + Qq) ■ nq , 

where 



To this end, we focus on a single phase j. We partition 
Ej into aj and hj : the loads transmitted from the buffers 
not in A during action {A, Uq) and during sequence 

Sq (Aj), respectively. 

The action (A, Uq) takes place in steps (tj-i, ij-i -I- 
Uq). Since the algorithm is work-conserving, and — by 
the lemma assumption — the total load in all buffers 
never drops below 1. the algorithm transmits the load 
Uq — aj from the buffers of A during these steps. The 
load of A at time (tj-i -\- nq)~ is then Cj-i — Uq -\- aj. 
At time tj^i + nq, the adversary chooses set Aj of Uq 
most populated buffers of A and a packet is injected to 
each buffer of Aj. Hence, at time (ij-i -I- nq)~^ , all the 
buffers of Aj are full and the load in the set A \ Aj is at 

most (Cj-l - 72q + Oj) ■ (1 - Uq/riq+l) = [Cj-l - Uq + 
Qj) ■ (1 - l/riq) = Cj-1 ■ (1 - l/Uq) - nq + 1 + Uj < 
Cj-l ■ -f — Uq + 1 + aj. 

At time tj_i -f Uq the adversary starts to execute 

the strategy Sq (Aj), ending at time tj. Let bj be 
the load transmitted by the algorithm during sequence 

^1/ 



S'q' "(Aj) from buffers A\ Aj. Then, the loads at 



in Aj : at most bj + b'^ + (7^ • Fq 
the inductive assumption); 

in A\ Aj] at most Cj_i • 7 — 



li/"? 



+ Qq) ■ riq (by 



l + aj-b'j. 



The load in A at time f^ is then Cj < Cj-i ■ 7 + (7-' 
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Qq = -q + 



+ Qq) ■ nq - Uq + 1 + Uj + bj < Cj-1 • 7 -|- (7-' • 

+ Q9+1) ■ nq -\- Ej, and thus (3.1) follows. 
By solving the recurrence of (3.1) with the bound- 
ary condition Co = ?iq+i, the following bound on C,. 



follows for any phase r. 

r 

First, we observe that X]^=i 7*^"^ ' -^i < Sj=i -^i ^ ^• 
Second, the coefficient at the (negative) term Qg+i • rig 

is E;=i 7""^' = (1 - 70/(1 - 7) > "9 • (1 - 70- Hence, 
reorganizing terms yields 






< 



+ ^E^r' + (i- 



7 ) ■ Qq+l 



3 = i 



As the number of phases is [/3ng], L^ = C|-/3„^]. Using 
^r/3n<il < e-/3^ _Q^^-|^ < (7 + 1, and further reorganizing 
terms, we obtain 

L^a-E J 1 r^l ., \ 

uq+i y "^1 ~{ J 

Therefore, for proving the lemma, it is sufficient to 
show that the term in bracket is at most iv+i- Indeed, 
applying Fact 3.1 yields 
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which completes the inductive proof. D 

3.5 The Competitive Ratio. When we look at the 
strategies S^ run on the set of n^ initially full buffers, we 
observe that when q is large and n is large in comparison 
to q, then (i) the length of the strategy is approximately 
(e — 1) ■ Uq, and (ii) after executing S]., the total load 
of any algorithm is o{nq). Hence, the throughput of 
the algorithm is roughly (e — 1) • n^, whereas Opt may 
serve the whole sequence losslessly, ending the strategy 
S^ with riq full buffers. Hence, the throughput of Opt 
is approximately c • rig, which yields the desired lower 
bound on the competitive ratio. These intuitions are 
formalized below. 



Proof. Let q > be a parameter which we will fix later. 
Let n = 4(7'^. The number of buffers used is m = Uq. 
At time 0, the adversary performs the initial injection, 
filling all buffers, and then executes the strategy Sq{B) 
against the algorithm Alg (recall that B is the set of all 
buffers). If at any time during this strategy, the total 
load drops below 1, then the adversary ends the input 
sequence immediately. 

We show that for large enough q, the competitive 
ratio of Alg incurred by Sq{B) is arbitrarily close to 
e/(e-l). 

Let £ be the last time when packets were injected. If 
the sequence does not end prematurely (because of the 
load drop), then £ is just the length of Sq{B), otherwise 
it is smaller. In either case, for the analysis, we may 
assume that the algorithm faced a canonic sequence 
that ended at time £. For our choice of n, it holds that 
(1 + 2g/n)9+i < (1 + l/(2g3))29 < gV?" < i + 2/q\ 
Hence, by Lemma 3.1, 
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£<^ + nq-Y: 
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Let c be the total load at time €"*" . If the sequence 
ended prematurely, then since all actions of S^ have 
length at most ng_i, it holds that c < l+ng_i. Now, we 
consider the case when the strategy S^ {B) was executed 
completely. As it operates on all the buffers, the term E 
in the statement of Lemma 3.2 is zero. Hence, by this 
lemma. 



\ h=0 k=0 / 



< Ul + 2/q^)- (q + 
= 0{l/q)-nq , 
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c-(q + 1)! 



q H I • ria 



where the second inequality follows by Fact 3.2. 

The throughput of Alg is at most £ + c. On the 
other hand, by Lemma 2.2, the throughput of Opt is 
£ + Hq. Hence, the competitive ratio of Alg is at least 



R = 



£ + c 



---0(1/9) 
— i 



Theorem 3.1. The competitive ratio of any fractional By taking sufficiently large q, the ratio can become 

algorithm Alg for the unweighted FIFO variant of the arbitrarily close to e/(e — 1). As the choice of Alg 

buffer management problem is at least e/(e — 1). The was arbitrary, no algorithm can achieve the competitive 

bound holds for any value of B and tti — >■ oo. ratio lower than e/(e — 1). D 



Theorem 3.1 combined with Lemm.a 2.1 yields the 
following corollary. 

Corollary 3.1. The competitive ratio of any random- 
ized algorithm Alg for the unweighted FIFO variant of 
the buffer management problem on is at least e/(e — 1). 
The bound holds for any value of B and ttt, — > (X). 

4 Conclusions 

We conclude with a few remarks about the fractional 
model. First of all, the buffer size B does not seem to 
be a relevant factor, at least for all known algorithms 
and lower bounds. Therefore, the fractional model 
seems to be simpler than the standard, integral one and 
developing optimal algorithms might be easier here. 

For the fractional model, we presented a new tight 
lower bound of e/(e— 1), which matches the performance 
of the algorithm Frac-Waterlevel by Azar and 
Litichevskey [AL06]. Although optimal when m tends 
to infinity, the bound is not applicable for smaller values 
of m. For m = 2, the exact competitive ratio of 16/13 is 
achieved by the Perpendicular Bisector algorithm 
of [BM08]; it matches the lower bound generated by 
the uniform strategy of [AS05]. Determining the exact 
competitive ratios for other values of m is an open prob- 
lem. In particular, it is not even clear whether the best 
possible lower bound can be incurred using canonical 
strategies. 
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A Flaw in the Analysis of the Random 
Permutation Algorithm 

The lower bound presented in this paper contradicts the 
claimed performance of the Random Permutation 
algorithm. In this section, we demonstrate an error 
in its original analysis [Sch05]. In terms of our lower 
bound, the main source of the flawed analysis in [Sch05] 
is an assumption that B-uniform strategies are the worst 
for an online algorithm. 

Random Permutation is defined for B = 1 and 
any number m of buffers. It chooses a permutation of 
the buffers uniformly at random. In any time step, it 
transmits a packet from the populated buffer which is 
most to the front in the permutation. 

The first packet arriving at a particular buffer 
is called initializing and all the subsequent packets 
injected to this buffer are non-initializing. We number 
all non-initializing packets. The probability of accepting 



the i-th non-initializing packet is denoted pi. Let 



m\ 



j=i 



i-E ("-iw-i)'- 



Lemma 1 of [Sch05] states that Pi > qi for 1 < i < m, 
provided that Opt can accommodate all non-initializing 
packets. 

In the proof of this lemma, it is argued that the 
worst possible situation occurs when all m initializing 
packets arrive at time and then, for 1 < i < m, the 
i-th non- initializing packet arrives at time i to buffer i. 
We call such an input sequence systematic (these are 
inputs generated by special types of uniform strategies 
used in our construction). Although the lemma holds 
for systematic sequences (it even holds that pi — qi), 
it is not true that the systematic sequences are the 
worst input instances for the Random Permutation 
algorithm. 

Our counterexample to Lemma 1 of [Sch05] is 
extremely simple. Let m ~ 10; m initializing packets 
are injected at time 0. For 1 < i < 7, the i-th non- 
initializing packet is injected at time i to buffer i. At 
time 8, no packets are injected. At time 9, 8-th and 9-th 
non-initializing packets are injected to buffers 8 and 9, 
respectively. At time 10, the 10-th non-initializing 
packet is injected to buffer 8. 

Note that till step 7, our sequence is systematic. 
The probabilities of accepting the 8-th and the 9-th 
packet are indeed slightly higher in our sequence than in 
the systematic one, i.e., up till step 9 claiming that the 
systematic sequence is the worst possible is justified. 
But our construction leaves buffers 8 and 9 full at 
time 9. Thus, the probability of accepting the 10-th 
non- initializing packet is at most 1/2 < qiQ. Note that 
this counterexample to the performance of Random 
Permutation follows the same line of attack as our 
lower bound, i.e., delaying the injections, filling multiple 
buffers at once, and finally injecting a subsequent packet 
to one of them. 

Subsequent parts of the analysis of Random Per- 
mutation [Sch05] use only the relation J2k=iPk ^ 
SfeLi 9fej which can be proved to be false using our 
construction as well. Showing it analytically would re- 
quire tedious calculations and would be in a great part 
a mere repetition of the arguments given in the origi- 
nal proof of [Sch05]. Since our construction is a small 
finite object, it can be verified by a trivial computer pro- 
gram. Such program can be found on the author's web 
page. Here, we just list the relevant values. Clearly, 
Pi = g, for i < 7. Moreover ps = P9 = 2 374 894/10! 
and pio = 1716 050/10!, whereas gg = 2 012 014/101, 
qg = 2 160 343/10!, and qio = 2 293 839/10!. This shows 



that X^fcLiPfe < X^I'Li'Jfcj proving Lemma 1 of [Sch05] 
false. 

B Proofs of Technical Claims 

Proof, (of Lemma 2.1) We fix any buffer size B and 
number of buffers m. 

By Qalg(*) and (5alg'(*) '^•^ denote the amount 
of packets in buffer i of algorithms Alg and Alg', 
respectively. The proof follows by simulation, i.e., we 
show that on the basis of Alg, it is possible to construct 
an algorithm Alg , such that — at any time of any input 
sequence — the following invariant holds for any buffer i: 

Qalc.'(«)>E[Oalg(i)] , 

and for any step where Alg transmits p packets in 
expectation, Alg' transmits load p as well. This 
implies that — on any input sequence — the throughput 
of Alg' is at least that of Alg, which yields the lemma. 
Clearly, the invariant holds at the beginning, when 
both Alg and Alg' have no packets. Then the proof 
follows inductively, by considering two possible types of 
events. 

1. A packet is injected to buffer i. If QalgK*) ^ B — 1 
before the injection, then afterwards QK-LG'{i) = B. 
Otherwise, QALG'(i) increases by 1 and E[(5alg(«)] 
increases at most by 1. In both cases, the invariant 
is preserved. 

2. Packets are transmitted during a step. Let (5alg(*) 
be the number of packets in buffer i after the 
transmission of Alg. Let qi = E[(5ALG(i) ~ 
Qalg(0]- Clearly, g^ > for aU i and X)"=li 9j < 1- 
To simulate this behavior of Alg and preserve the 
invariant, Alg transmits load Qi from buffer i. 



Proof (of Fact 3.1) 
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Proof, (of Fact 3. 2) We start with bounding a related 
term. 
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Now, we use the bound above to prove the desired fact. 
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